SEQUENCE LISTING 



(1) .GENERAL INFORMATION: 

(i) APPLICANT : TANABE, Tadashi 

(ii) TITLE OF INVENTION: PROSTACYCLIN SYNTHASE DERIVED FROM HUMAN 

(iii) NUMBER OF SEQUENCES : 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : SUGHRUE, MION, ZINN, MAC PEAK. &. SEAS 
• (B) STREET: 2100 Pennsylvania Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP : 20037 

(v) COMPUTER READABLE FORM: . 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible ' 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/578,709 

(B) FILING DATE: 28-DEC-1995 

, (C) CLASSIFICATION: • , '" ,./ 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/JP95/00838 

(B) FILING DATE: 27-APR-1995 

(vii) PRIOR APPLICATION DATA: ' '. 

(A) APPLICATION NUMBER: JP 114316/1994 
'(B) FILING DATE: 28-APR-1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Gubinsky, Louis 

(B) REGISTRATION NUMBER: 24,835 

(C) REFERENCE/DOCKET NUMBER: Q40439: 

. (ix) TELECOMMUNICATION INFORMATION: 
(ft) TELEPHONE: (202)293-7060 
(B) TELEFAX: (202)293-7860 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: /desc = "PRIMER/SYNTHETIC DNA' 



(xi) SEQUENCE DESCRIPTION: SEQ .ID NO:l: 
GGGACAAGGA CCACAT • 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: • /desc = "PRIMER/ SYNTHETIC DNA 1 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : . 
CAAAAGTCGC. CTGTGGAAGC ' . . 

(2) INFORMATION FOR SEQ ID NO : 3 : - . 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs , ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER/SYNTHETIC DNA' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CACAGGCGAC TTTTGACA . ' 

(2) INFORMATION FOR SEQ ID NO : 4 : 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

• (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE . TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER/ SYNTHETIC DNA' 



(xi) SEQUENCE DESCRIPTION:- SEQ ID NO : 4 : 
TGCCTGCATC TCCTCTGA 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid, 

' ' (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 

' (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " PRIMER/ SYNTHETIC DNA" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GACTCGAGTC GACATCGATT TTTTTTTTTT TTTTT : , 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.- 17 base pairs •• . . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc . = ■ "PRIMER /SYNTHETIC DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GACTCGAGTC GACATCG - 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER/ SYNTHETIC RNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CUACUACUAC UAGGCCACGC GTCGACTAGT ACGGGNNGGG NNGGGNNG 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 

■; (ii) MOLECULE TYPE : cDNA 

(vi) ORIGINAL SOURCE:, 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: lambda hPGIS141 

(ix) FEATURE: •; 

(A) NAME/ KEY : CDS 
-(B) LOCATION: 1. .183 ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GGG GAC AAG GAC CAC ATG TGC AGT GTC AAA AGT CGC CTG TGG AAG CTG 

Gly Asp Lys Asp His Met Cys Ser Val Lys Ser Arg Leu Trp Lys Leu 

1 5 10 ■ , 15 ' - 

CTA TCC CCA GCC AGG CTG GCC. AGG CGG GCC CAC.CGG AGC AAA TGG CTG 

Leu Ser Pro Ala Arg Leu Ala' Arg Arg Ala His Arg Ser Lys Trp Leu 



GAG AGT TAC CTG CTG CAC CTG GAG GAG ATG GGT GTG TCA GAG GAG ATG 
Glu Ser Tyr Leu Leu His Leu Glu Glu Met Gly Val Ser GlU Glu Met 

'.. 35 ; 40 ' . .45 

CAG GCA CGG GCC CTG GTG CTG CAG CTG TGG GCC ACA CAG 
Gin Ala Arg Ala Leu Val Leu Gin Leu Trp Ala Thr Gin 



20 



25 



30 



50 



55 



60 



(2) INFORMATION 



FOR 



SEQ 



ID NO: 9 



(i) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Gly Asp Lys Asp His Met Cys Ser Val Lys Ser Arg Leu Trp Lys Leu 
1 5 10 15 



Leu Ser Pro Ala Arg Leu Ala Arg Arg Ala His Arg Ser Lys Trp Leu 

20 , ' 25 ' 30 ' 



Glu Ser Tyr Leu Leu His Leu Glu Glu Met Gly Val Ser Glu Glu Met 

35 40 ;> 45 



Gin Ala Arg Ala Leu Val Leu Gin Leu Trp. Ala Thr Gin 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 792 base pairs 

(B) TYPE: nucleic acid 

''V. . (C) STRANDEDNESS: double - 

(D) TOPOLOGY: linear: 

. . (ii) MOLECULE, .TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: pHPGIS36 

(ix) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 80 



' (ix) FEATURE :' 

(A) NAME/KEY : . mat_peptide 

(B) LOCATION: 80.. 790. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTACTACTAC TAGGCCACGC GTCGACTAGT ACGGGGGGGG GGGGGGGGGG GCAGCCCCGC 

CAGCCCCGCC AGCCCCGCG ATG GCT TGG GCC GCG CTC CTC GGC CTC CTG GCC - 
- Met Ala Trp Ala Ala Leu Leu Gly Leu Leu Ala 



GCA CTG TTG CTG CTG CTG CTA CTG AGO CGC CGC CGC ACG CGG CGA CCT 
Ala Leu. Leu Leu Leu Leu Leu Leu Ser Arg Arg Arg Thr Arg Arg Pro 



GGT GAG CCT CCC CTG GAC CTG GGC AGC ATC CCC TGG TTG GGG TAT GCC 
Gly. Glu Pro Pro Leu Asp Leu Gly Ser lie Pro, Trp Leu Gly Tyr Ala 



TTG GAC TTT GGA AAA GAT GCT GCC AGC TTC CTC ACG AGG ATG AAG GAG 
Leu Asp Phe Gly Lys Asp Ala Ala Ser Phe Leu Thr. Arg Met Lys Glu 



AAG CAC GGT GAC ATC TTT ACT ATA CTG GTT. GGG GGC AGG TAT GTC ACC 
Lys His Gly Asp He Phe Thr lie Leu Val Gly Gly Arg Tyr Val Thr 



GTT CTC GTG GAC CCA CAC TCC TAC GAC GCG GTG GTG TGG GAG CCT CGC 
Vali Leu Leu Asp Pro His Ser Tyr Asp Ala Val Val Trp Glu Pro Arg 
80 85 90 



352 



ACC AGG CTG GAC TTC CAT GCC TAT GCC ATC TTC CTC ATG GAG AGG ATT 
Thr Arg Leu Asp. Phe His Ala Tyr Ala lie Phe Leu Met Glu Arg lie 
95 100 105 

TTT GAT GTG CAG CTT CCA CAT TAC AGC CCC AGT GAT GAA AAG GCC AGG 
Phe Asp Val Gin Leu Pro His Tyr Ser Pro Ser Asp Glu Lys Ala Arg 
'110 115' 120 • - 

ATG AAA CTG ACT CTT CTC GAC AGA GAG CTC CAG GCA CTC ACA GAA GCC 
Met Lys Leu Thr Leu Leu His Arg Glu Leu Gin Ala Leu Thr Glu Ala 

125 ,'■ 130 135 

ATG. TAT ACC AAC CTC CAT GCA GTG CTG TTG GGC GAT GCT ACA GAA GCA 
Met Tyr. Thr Asn Leu His Ala Val Leu Leu Gly Asp Ala Thr Glu Ala 
140 145 -150 155 

GGC AGT GGC TGG CAC GAG ATG GGT CTC CTC GAC TTC TCC TAG AGC TTC 
Gly . Ser Gly Trp His Glu Met Gly Leu Leu Asp Phe Ser Tyr Ser Phe 
. 160 ' 165 170 

CTG CTC AGA GCC GGC TAC CTG ACT CTT TAC GGA ATT GAG GCG CTG CCA 
Leu Leu Arg Ala. Gly Tyr Leu Thr Leu Tyr Gly lie Glu Ala Leu Pro 
175 ' 180 , 185 ' 

CGC ACC CAT GAA AGC CAG GCC CAG GAC CGC GTC CAC TCA GCT GAT GTC ■ 
Arg Thr His Glu Ser Gin Ala Gin Asp Arg Val His Ser Ala Asp Val. 
190 . w " 195 200 

TTC CAC. ACC. TTT CGC CAG CTC GAC , CGG CTG CTC CCC AAA CTG GCC CGT 
Phe His Thr Phe Arg Gin Leu Asp Arg Leu Leu Pro Lys Leu Ala Arg 

205 " " 210 V 215 . 

GGC TCC CTG TCA GTG GGG. GAC AAG GAC CAC ATG TGC AGT GTC AAA AGT 
Gly Ser Leu Ser Val Gly Asp Lys Asp His Met Cys Ser Val Lys Ser: 
220 ' 225 ; 230 * • 235 

CGC CTG TG - ''. .. .. 

Arg Leu ' \ . . ' V • 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids . ,' 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Met Ala Trp Ala Ala Leu Leu Gly Leu Leu Ala Ala Leu Leu Leu Leu 
1 . 5 10 15 



Leu Leu Leu Ser Arg Arg Arg Thr Arg Arg Pro Gly Glu Pro Pro Leu 



; 20 . 25 30 

Asp Leu Gly Ser Tie Pro Trp Leu Gly Tyr Ala Leu Asp Phe Gly Lys 
35/ 40 45 

Asp Ala Ala Ser Phe Leu Thr Arg Met Lys Glu Lys His Gly Asp lie 
50 55 60 

Phe Thr lie Leu Val Gly Gly Arg Tyr Val Thr Val Leu Leu Asp Pro 
65 70 • 75 80 

His Ser Tyr Asp Ala Val Val Trp Glu Pro Arg Thr Arg. Leu Asp Phe 
85 90 95 

His Ala Tyr Ala He Phe Leu Met Glu Arg lie Phe Asp Val Gin Leu 
• 100 105 . 110 

Pro His Tyr Ser Pro Ser Asp Glu Lys Ala Arg Met Lys Leu Thr Leu 
lib 120 i.?5 - 

Leu His Arg Glu Leu Gin Ala Leu Thr Glu Ala Met Tyr Thr Asn Leu 
130 '13 5 140 

His Ala Val Leu Leu Gly Asp Ala Thr Glu Ala Gly Ser Gly Trp His 
145 150 ' • ; 155 160 

Glu Met Gly Leu Leu Asp Phe Ser Tyr Ser Phe Leu Leu Arg Ala Gly 

• 165 • , 170 ■•' 175 

Tyr Leu Thr Leu Tyr Gly He Glu Ala Leu Pro Arg Thr Hi's Glu Ser 
180 . 185 190 

Gin Ala Gin Asp Arg Val His Ser Ala Asp Val Phe His Thr Phe Arg 
195 200 205 

Gin Leu Asp Arg Leu Leu Pro Lys Leu Ala Arg Gly Ser Leu Ser Val 
210 215 220 

Gly Asp Lys Asp His Met Cys Ser Val Lys Ser: Arg Leu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens • ' 

(B) STRAIN: pHPGIS135 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 



(B) LOCATION.: 3. .827 



(ix) FEATURE: 

(A) NAME /KEY: . ma tj>ep tide 

(B) LOCATION: 3. .827 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



GG GAC AAG .GAC CAC ATG TGC AGT GTC AAA AGT CGC CTG TGG AAG CTG 
Asp Lys Asp His Met Cys Ser Val Lys Ser Arg Leu Trp Lys Leu 



CTA TCC CCA GCC AGG CTG GCC AGG CGG GCC CAC CGG AGC AAA TGG CTG. 
Leu Ser Pro Ala Arg Leu Ala Arg Arg Ala His Arg Ser Lys Trp Leu 



GAG AGT. TAC CTG CTG CAC CTG GAG GAG ATG GGT GTG TCA GAG GAG ATG 
Glu Ser Tyr Leu Leu His Leu Glu Glu Met Gly Val Ser Glu Glu Met 



CAG GCA CGG GCC CTG GTG CTG CAG CTG TGG GCC ACA CAG GGG AAT ATG 
Gin Ala Arg Ala Leu Val Leu Gin Leu Trp Ala Thr Gin Gly Asn Met 



GGT CCC GCT GCC TTC TGG CTC CTG CTC TTC CTT CTC AAG AAT CCT GAA 
Gly Pro Ala Ala Phe Trp Leu Leu Leu Phe Leu. Leu. Lys Asn Pro Glu 



GCC CTG GCT GCT GTC CGC GGA GAG CTC GAG AGT ATC CTT TGG CAA GCG 
Ala Leu Ala Ala Val Arg Gly Glu Leu Glu Ser lie Leu Trp Gin Ala 



GAG CAG CCT GTC TCG CAG ACG ACC ACT CTC CCA CAG AAG GTT CTA GAC 
Glu Gin Pro Val Ser Gin Thr Thr. Thr Leu Pro Gin Lys Val Leu Asp 

ioo 105 no 

AGC ACA CCT GTG CTT GAT AGC GTG CTG AGT GAG AGC CTC AGG CTT ACA 
Ser Thr Pro Val Leu Asp Ser Val Leu Ser Glu Ser Leu A^g Leu Thr' 
115 120 125 

GCT GCC CCC TTC ATC ACC CGC GAG GTT GTG GTG GAC CTG GCC ATG CCC 
Ala Ala Pro Phe lie Thr Arg Glu Val Val Val Asp Leu Ala Met Pro 
130 135 140 

ATG GCA GAC GGG AG A GAA TTC AAC CTG CGA CGT GGT GAC CGC CTC CTC 
Met Ala Asp Gly Arg Glu Phe Asn Leu Arg Arg Gly Asp Arg Leu Leu 
145 150 155 

CTC TTC CCC TTC CTG AGC CCC CAG AGA GAC CCA GAA ATC TAC ACA GAC 
Leu Phe Pro Phe Leu Ser Pro Gin Arg Asp Pro Glu lie Tyr Thr Asp 
160 165 170 175 

CCA GAG GTA TTT AAA TAC AAC CGA TTC CTG AAC CCT GAC GGA TCA GAG 
Pro Glu Val Phe Lys Tyr Asn Arg Phe Leu Asn Pro Asp Gly Ser Glu 
180 . 185 190 



AAG AAA GAC TTT TAC AAG GAT GGG AAA CGG CTG AAG AAT TAC AAC ATG 623 
Lys Lys Asp Phe Tyr Lys Asp Gly Lys Arg Leu . Lys Asn Tyr Asn Met 
195 - 200 205 

CCC TGG GGG GCG GGG CAC AAT CAC TGC CTG GGG AGG AGT TAT GCG GTC 671 
Pro Trp Gly Ala Gly His Asn His Cys Leu Gly Arg Ser Tyr Ala Val 
■ 210 - 215' 220 

AAC AGC ATC AAA CAA TTT GTG TTC CTT GTG CTG GTG CAC TTG GAC TTG 7 19 

Asn Ser lie Lys Gin Phe Val Phe Leu Val Leu Val' His Leu Asp Leu 

225 ' 230 . ' 235 

GAG CTG ATC AAC GCA GAT GTG GAG ATC CCT GAG TTT GAC CTC AGC AGG "; 767 

Glu Leu lie Asn Ala Asp Val Glu lie Pro Glu Phe Asp Leu Ser Arg 
240 245 . .250' 255 . 

TAC GGC TTC GGT CTG ATG CAG CCG GAA CAC GAC GTG CCC GTC CGC TAC 815, 
•Tyr Gly Phe Gly Leu Met Gin Pro Glu. His Asp Val Pro Val Arg Tyr 
• 260 265 270 

CGC ATC CGC CCA TGACACAGGG AG C AG ATGG A TCCACGTGCT CGCCTCTGCC 867 
Arg lie Arg Pro ". ■ ■ 

' ....275. " ■ ■;' . " -.. 

CAGCCTGCCC CAGCCTGCCC CAGCCTCCCA GCTTTCTGTG TGCACAGTTG GCCCGGGTGC ' 927 

AGGTGCTAGC ATTACCACTT CCCTGCTTTT CTCCCAGAAG GCTGGGTCCA GGGGAGGGAA .987 

AAGCTAAGAG GGTGAACAAA GAAAAGACAT TGAAAGCTCT ATGGATTATC CACTGCAAAG 1047 

TTTTCTTTCC AAAATCAGGC TTTGTCTGCT CCCAATTCAC CTCGTTACTC TCACCTCGTG 1107 

ATATCCACAA ATGCTATTCA GATAAGGCAG AACTAGGAGT CTTCACTGCT CTGCCCCCAA 1167 

CTCCCGGAGG TGTCACCTTC CTAGTTCTTA TGAGCTAGCA TGGCCCGGGC CTTATCCAGT 1227 

CAAAGCGGAT GCTGGCCACA GAAAGGCCAC TCAGGATGTC CTTTGTGTCC ATCG ATGTCG 1287 

ACTCGAGTC . 1296 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 atnino aeids 

(B) TYPE: amino acid 
■ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Lys Asp His Met Cys Ser Val Lys Ser Arg Leu Trp Lys Leu Leu 
1 5 10 r 15 



Ser Pro Ala Arg Leu Ala Arg Arg Ala His Arg Ser Lys Trp Leu Glu 
20 25 30 



Ser Tyr Leu Leu His Leu Glu Glu Met Gly Val Ser Glu Glu Met Gin 
35 - 40 45 



Ala Arg Ala Leu Val Leu Gin Leu Trp Ala Thr Gin Gly Asn Met Gly 
50 55 60 

Pro Ala Ala Phe Trp Leu Leu Leu Phe Leu Leu Lys Asn Pro Glu Ala 



Leu Ala Ala Val Arg Gly Glu Leu Glu Ser He Leu Trp Gin Ala Glu 
85 90 95 

Gin Pro Val Ser Gin Thr Thr Thr Leu Pro Gin Lys Val Leu Asp Ser 
.100 : , 105 110 

Thr Pro Val Leu Asp Ser Val Leu Ser Glu Ser Leu Arg Leu Thr Ala 
115 120 125 

Ala Pro Phe He Thr Arg Glu Val Val Val Asp Leu Ala Met Pro Met 
.130 • 135 140 

Ala Asp Gly Arg Glu Phe Asn Leu Arg Arg Gly Asp Arg Leu Leu Leu 
145 150 ' 155 . 160 

Phe Pro Phe Leu Ser Pro' Gin Arg. Asp Pro Glu He Tyr Thr Asp Pro 
165 170 175 

Glu Val Phe Lys Tyr Asn Arg Phe Leu Asn Pro Asp Gly Ser Glu Lys 
180 ',* 185 190 

. Lys Asp Phe Tyr Lys Asp Gly Lys Arg Leu Lys Asn Tyr Asn Met Pro 
195 200 205 

Trp Gly Ala Gly His Asn His Cys Leu Gly Arg Ser Tyr Ala Val Asn 

210 215 , , 220 

Ser lie Lys Gin Phe Val Phe Leu Val Leu Val His Leu Asp Leu Glu 
225. ' 230 235 240 

Leu He Asn Ala Asp Val Glu He Pro Glu Phe Asp Leu Ser Arg Tyr 
245 250 .255 

Gly Phe Gly Leu Met Gin Pro Glu His Asp Val Pro Val Arg Tyr Arg 
260 265 270 

. He Arg Pro 
275 

(2) INFORMATION FOR SEQ ID NO; 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS' 

(B) LOCATION: 28. .1527 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

■'(B) LOCATION: 28.. 1527. 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 14 : ; C 

AGCCCCGCCA GCCCCGCCAG CCCCGCG ATG GCT TGG GCC GCG CTC CTC GGC 
• •"• , Met Ala Trp .Ala Ala Leu Leu Gly 



CTC CTG GCC GCA CTG TTG CTG CTG CTG CTA CTG AGC CGC CGC CGC ACG , 
Leu Leu Ala Ala Leu Leu . Leu Leu Leu Leu Leu Ser Arg Arg Arg Thr 



CGG CGA CCT GGT GAG CCT CCC CTG GAC CTG GGC AGC ATC CCC TGG TTG 
Arg Arg Pro Gly Glu Pro Pro Leu Asp Leu Gly Ser lie Pro Trp Leu 



GGG TAT GCC TTG GAC TTT GGA AAA GAT GCT GCC AGC TTC CTC ACG AGG 
Gly Tyr Ala Leu Asp Phe Gly Lys Asp Ala Ala Ser Phe Leu Thr Arg 



ATG AAG GAG AAG CAC GGT . GAC ATC TTT ACT ATA CTG GTT GGG GGC AGG 
Met Lys Glu Lys His Gly Asp lie Phe Thr lie Leu Val Gly Gly Arg 



TAT GTC ACC GTT CTC CTG GAC CCA CAC TCC TAC GAC GCG GTG GTG TGG 
Tyr Val Thr Val Leu Leu Asp Pro His Ser Tyr Asp Ala" Val- Val Trp 



GAG CCT CGC ACC AGG CTC GAC TTC CAT GCC TAT GCC ATC TTC CTC ATG 
Glu Pro Arg Thr Arg Leu Asp Phe His Ala Tyr Ala lie Phe Leu Met. 
90 95 100 

GAG AGG ATT TTT GAT GTG CAG CTT CCA CAT TAC AGC CCC AGT GAT GAA 
Glu Arg lie Phe Asp Val Gin Leu Pro His Tyr Ser Pro Ser Asp Glu 
105 110 115 • . 120 

AAG GCC AGG -ATG AAA CTG ACT CTT CTC GAC AGA GAG CTC CAG GCA CTC 
Lys Ala Arg Met Lys Leu Thr Leu Leu His Arg Glu Leu Gin Ala Leu. 

125 • f 130 135 

ACA GAA GCC ATG TAT ACC AAC CTC CAT GCA GTG CTG TTG GGC GAT GCT 
Thr Glu Ala Met Tyr Thr Asn Leu His Ala Val Leu Leu Gly Asp Ala 
140 145 150 

ACA GAA GCA GGC AGT GGC TGG CAC GAG ATG GGT CTC CTC GAC TTC TCC 



Thr" 


Glu : 


Ala Gly 

155 ..' 


Ser 


Gly 


Trp 


His 
160 


Glu 


Met 


Gly 


Leu 


Leu 
165 


Asp 


Phe 


Ser 




TAC 
fyr 


AGC 
Ser 
170 


TTC CTG 
Phe Leu 


CTC 
Leu 


AGA 
Arg 


GCC 
Ala 

17 5, 


GGC 
Gly 


TAC 
Tyr 


CTG 
Leu 


ACT 
Thr 


CTT 
Leu 
180 


TAG 
Tyr 


GGA 
Gly 


ATT 
He 


GAG • 
Glu 


• 57 9 


GCG 
Ala 
.185 


CTG 
Leu 


CCA CGG 
Pro Arg 


ACC 
Thr 


CAT 
His 
190 


GAA 
Glu 


AGC 
Ser 


CAG 
Gin 


GCC 
Ala 


CAG 
Gin 
195 


GAC 
Asp 


CGC 
Arg 


GTC 
Val 


CAC 
His 


TCA 
Ser 
200 


'62 7 


' GCT 
Ala 


GAT. 
Asp 


GTC TTC 
Val Phe 


CAC 
His 
205 


ACC 
Thr 


TTT 
Phe 


CGC 
Arg, 


CAG 
Gin 


CTC 
Leu 

210 


GAC 
Asp 


CGG 
Arg 


CTG 
Leu 


CTC 
Leu 


CCC 
Pro 
215 


AAA 

Lys 


675 


CTG 
Leu 


GCC 
Ala 


CGT GGC TCC 
Arg Gly Ser 
220 


CTG 
Leu 


TCA 
Ser 


GTG 
Val 


GGG 
Gly 
225 


GAC 
Asp 


AAG 
Lys 


GAC 
Asp 


CAC 
His 


ATG 
Met 
230 


TGG 
Cys 


AGT 

Ser ; 


".; , 723 


GTC 
Val 


AAA 

Lys 


AGT CGC 
Ser Arg 
235 


CTG 
Leu 


TGG 

Trp 


AAG 
Lys 


CTG, 
Leu 
240 


CTA 
Leu 


TCC 
Ser 


CCA 
Pro 


GCC 
Ala 


AGG 
Arg 

245 


CTG 
Leu 


GCC 
Ala 


'AGG 
Arg 


771 


CGG 

, Arg 


GCC 
Ala 
250 


CAC CGG 
His .Arg 


AGC 
Ser 


AAA 
Lys 


TGG 
Trp 
255 


CTG 
Leu 


GAG. AGT 
Glu Ser 


TAC 

Tyr : 


CTG 
Leu 
260 


CTG- CAC 
Leu His 


CTG 
Leu 


GAG 
Glu . 


819 


GAG 
Glu 
265 


ATG 
Met 


GGT GTG 
Gly Val 


TCA 
Ser 


GAG 
Glu 

'270 


GAG 
Glu 


ATG 
Met 


CAG GCA. 
Gin Ala 


CGG 
Arg 
275 


GCC 
Ala 


CTG 
Leu 


GTG 
Val 


CTG 
Leu 


CAG 
Gin 
280 


867 


CTG TGG 
Leu Trp 


GCC AG A 
Ala Thr 


CAG 
Gin 
285 


GGG 
Gly 


AAT 
Asn 


ATG 
Met 


GGT CCC 
Gly Pro 
290 


GCT 
Ala 


GCC 
Ala 


TTC 
Phe 


TGG 
Trp 


CTC. 
Leu 
295 


ctg' , 

Leu 


915 


CTC 
Leu 


TTC 
Phe 


CTT CTC 
Leu Leu 
300 


AAG 
Lys 


AAT 
Asn 


CCT 
Pro 


GAA 
Glu 


GCC 
Ala 
305 


CTG 
Leu 


GCT 
Ala 


GCT 
Ala 


GTC 
Val 


CGC. 
Arg 
310 


GGA 
Gly 


GAG 

Glu .," 


■: 963 


CTC 
Leu 


GAG 
Glu 


AGT ATC 
Ser lie 
315 


CTT 
Leu 


TGG. 
Trp 


CAA 
Gin 


GCG 
Ala 
320 


GAG 
Glu 


CAG 
Gin 


CCT 
Pro 


GTC 
Val 


TCG 
Ser 
325 


CAG 
Gin 


ACG 
Thr 


ACC 
Thr 


1011 


ACT 
Thr 


CTC 
Leu 
330 


CCA CAG AAG 
Pro Gin Lys 


GTT 
Val 


CTA 
Leu 
335 


GAC 
Asp 


AGC ACA 
Ser Thr 


CCT 
Pro 


GTG 
Val 
340 


CTT 
Leu 


GAT 
Asp 


AGC 
Ser 


GTG 
Val 


,10 5 9 


CTG AGT 
Leu Ser 
345 


GAG AGC 
Glu Ser 


CTC 
Leu 


AGG 
Arg 

• 3 5 0 


CTT 
Leu 


ACA 
Thr 


GCT GCC 
Ala Ala 


CCC 
Pro 
355 


TTC 
Phe 


ATC 
lie 


ACQ 
Thr 


CGC 
Arg 


GAG 
Glu 
360 


1107 


GTT 
Val 


GTG 
Val 


GTG GAC CTG 
Val Asp Leu 

. 365 


GCC 
Ala 


ATG 
Met 


CCC 
Pro 


ATG GCA 
Met Ala 
370 


GAC 
Asp 


GGG 
Gly 


AGA 
Arg 


GAA 
Glu 


TTC 
Phe 
375 


AAC 
Asn 


1155 


CTG CGA 
Leu Arg 


CGT GGT GAC 
Arg Gly Asp 


CGC 
Arg 


CTC 
Leu 


CTC 
Leu 


CTC TTC 
Leu Phe 


CCC 
Pro 


TTC 
Phe 


CTG 
Leu 


AGC 
Ser 


CCC 
Pro 


CAG 
Gin 


1203 



380 



385 



AG A GAC CCA GAA ATC TAC ACA GAC CCA GAG GTA TTT AAA TAC AAC CGA 12 51 

Arg Asp Pro Glu lie Tyr Thr Asp Pro Glu Val Phe Lys Tyr Asn Arg 
395 400 405 

TTC CTG AAC CCT GAC GGATCA GAG AAG AAA GAC TTT TAC AAG GAT GGG 12 99 

Phe Leu Asn Pro Asp Gly Ser Glu Lys Lys Asp Phe Tyr Lys Asp Gly 
410 ' 415 420 

AAA CGG CTG AAG AAT TAC AAC ATG CCC TGG GGG GCG GGG CAC AAT CAC 134 7 

Lys Arg Leu Lys Asn Tyr Asn Met Pro Trp Gly Ala Gly His Asn His 
425 430 -435 ■ 440 

TGC CTG GGG AGG AGT TAT GCG GTC AAC AGC ATC AAA CAA TTT GTG TTC 1395 
Cys Leu Gly Arg Ser Tyr Ala Val Asn Ser lie Lys Gin Phe Val Phe 
445 450 455 

CTT GTG CTG GTG CAC TTG GAC TTG GAG CTG ATC AAC GCA GAT GTG GAG : 1443 

Leu Val Leu Val His Leu Asp Leu Glu Leu lie Asn Ala Asp Val Glu 
460 ' .465 470 

ATC CCT GAG TTT GAC CTC AGC AGG TAC GGC TTC GGT CTG ATG CAG CCG 14 91 

lie Pro Glu : Phe Asp Leu Ser Arg Tyr Gly Phe Gly Leu Met Gin Pro 

/ ... 475 480 . 485 . - 

GAA CAC GAC GTG CCC GTC CGC TAC CGC ATC CGC CCA TGACACAGGG 15 37. 

Glu His Asp Val Pro Val Arg Tyr Arg lie Arg Pro 
490 495 '500 

AGCAGATGGA TCCACGTGCT CGCCTCTGCC CAGCCTGCCC CAGCCTGCCC CAGCCTCCCA 15 97 

GCTTTCTGTG TGCACAGTTG GCCCGGGTGC AGGTGCTAGC ATTACCACTT CCCTGCTTTT 16S7 

CTCCCAGAAG GCTGGGTCCA GGGGAGGGAA AAGCTAAGAG GGTGAACAAA G AAAAG AC AT 1717 

TGAAAGCTCT ATGGATTATC CACTGCAAAG TTTTCTTTCC AAAATCAGGC TTTGTCTGCT . 17 77 

CCCAATTCAC CTCGTTACTC TCACCTCGTG ATATCCACAA ATGCTATTCA GATAAGGCAG • 1837 

AACTAGGAGT CTTCACTGCT CTGCCCCCAA CTCCCGGAGG TGTCACCTTC CTAGTTCTTA 1897 

TGAGCTAGCA TGGCCCGGGC CTTATCCAGT CAAAGCGGAT GCTGGCCACA GAAAGGCCAC 1957 

TCAGGATGTC CTTTGTGTCC 1977 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Met Ala Trp Ala' Ala Leu Leu Gly Leu Leu Ala Ala Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Ser Arg Arg Arg Thr Arg Arg Pro Gly Glu Pro Pro Leu 



Asp Leu Gly Ser lie Pro Trp Leu Gly Tyr Ala Leu Asp Phe Gly Lys 

35 , < 40 '.' ' 45 

Asp Ala Ala Ser Phe Leu Thr Arg Met Lys Glu Lys His Gly Asp lie 
50 55 60 

Phe Thr lie' Leu Val Gly Gly Arg Tyr Val Thr Val Leu Leu Asp Pro 
65 70, 75 80 

His Ser Tyr Asp Ala Val Val Trp Glu' Pro Arg Thr Arg Leu Asp Phe 
85 90 95 

His Ala Tyr Ala lie Phe Leu Met. Glu Arg lie Phe Asp Val Gin Leu 
. 100 105 110 

Pro His Tyr Ser Pro Ser Asp Glu Lys Ala Arg Met Lys Leu' Thr Leu 
115 120 *• •' • 125 

Leu His Arg Glu Leu Gin Ala Leu Thr Glu Ala Met Tyr Thr Asn Leu 

130 • . ' 135 ' " 140 

His Ala Val Leu Leu Gly Asp Ala:Thr.Glu Ala Gly Ser Gly Trp His 
145 150 . 155 160 • 

Glu Met Gly Leu Leu Asp Phe Ser Tyr Ser Phe Leu Leu Arg Ala Gly 
165 '170 175 

Tyr Leu Thr Leu Tyr Gly lie Glu Ala Leu Pro Arg Thr His Glu Ser 
180 185 . 190 

Gin t Ala Gin Asp Arg Val His Ser Ala Asp Val Phe His Thr Phe Arg 
195 200- 205 

Gin Leu Asp Arg Leu Leu Pro Lys Leu Ala Arg Gly Ser Leu Ser Val 
210 . 215 220 

Gly Asp Lys Asp His Met Cys Ser Val Lys Ser Arg Leu Trp Lys Leu 
225 230 235 240 

Leu Ser Pro Ala Arg Leu Ala Arg Arg Ala His Arg Ser Lys Trp Leu 
245 ■ 250 255 

Glu Ser Tyr Leu Leu His Leu Glu Glu Met Gly Val Ser Glu Glu Met 
260 265 270 

Gin Ala Arg Ala Leu Val Leu Gin Leu Trp Ala Thr Gin Gly Asn Met 
275 280 285 

Gly Pro Ala Ala Phe Trp Leu Leu Leu Phe Leu Leu Lys Asn Pro Glu 
290 295 300 



Ala Leu Ala Ala Val Arg Gly Glu Leu Glu Ser lie Leu Trp Gin Ala 
305 310 315 320 



Glu Gin Pro Val Ser Gin Thr Thr Thr Leu Pro Gin Lys Val Leu Asp 

325 330 . 335 

Ser Thr Pro Val Leu Asp Ser Val Leu Ser Glu Ser Leu Arg Leu Thr 
.340 345 350 

Ala Ala Pro Phe He Thr Arg Glu Val Val Val Asp Leu Ala Met Pro 
355 . 360 365 

Met Ala Asp Gly Arg Glu Phe Asn Leu Arg Arg Gly Asp Arg Leu Leu 
370 375 380 

Leu Phe Pro. Phe Leu Ser Pro Gin Arg Asp Pro Glu He Tyr Thr Asp 
385 • " 390 , 395 400, 

Pro Glu Val Phe Lys Tyr Asn Arg Phe Leu Asn Pro Asp Gly Ser Glu 
405 410 415 

Lys Lys Asp Phe Tyr Lys Asp Gly Lys Arg Leu Lys Asn Tyr Asn Met 
. 420 425 430 

. Pro Trp Gly Ala Gly His Asn His Cys Leu Gly Arg Ser Tyr Ala Val 
435 440 445 

Asn Ser He Lys Gin Phe Val Phe Leu Val Leu Val His Leu Asp Leu 
450 455 ' 460 

Glu Leu lie Asn Ala Asp Val Glu He Pro Glu- Phe Asp Leu Ser Arg 
4.65 470 . 475 480 

Tyr Gly Phe Gly Leu Met Gin Pro Glu His Asp Val Pro Vai Arg Tyr 
485 490 495, 

\,.''; ; ■ •. 

Arg He Arg Pro 
500 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 1 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " PRIMER/ SYNTHETIC DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GACAAGGACC ACATGTGCAG TGTC 



(2) INFORMATION FOR SEQ ID NO:17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
" (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) . DESCRIPTION: /desc = "PRIMER/ SYNTHETIC DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGTGTGGCC CACAGCTGCA GCAC 



